Project C CSE 494 / 598 Hemal Khatri

نویسنده

  • Hemal Khatri
چکیده

CSE 494/598 Hemal Khatri Overview This goal of this project is to mine patterns in the search results by clustering the search results. The different methods used for clustering are: 1. K-Means 2. Buckshot 3. Bisecting K-Means. For each of these methods, we show the algorithm used for computing the clusters as well as the time complexity for each of these algorithms. We also compare the quality of the clusters across these methods using cluster similarity measure such as intra-cluster similarity and inter-cluster similarity. We also experiment with different number of clusters as well as using varying number of documents for clustering. A GUI is developed as a web servlet which displays the clusters to a given user query in a web page.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CSE 494 CSE / CBS 598 ( Fall 2007 ) : Numerical Linear Algebra for Data Exploration — Two dimensional SVD and PCA

• Let Ai ∈ IRr×c, for i = 1, · · · , n, be the n data points in the training set, where r and c denote the number of rows and columns respectively for each Ai. We aim to compute two matrices L ∈ IRr×`1 and R ∈ IRc×`2 with orthonormal columns, and n matrices Mi ∈ IR`1×`2 , for i = 1, · · · , n, such that LMiR approximates Ai, for all i. Here, `1 and `2 are two prespecified parameters that are be...

متن کامل

QUERY PROCESSING OVER INCOMPLETE AUTONOMOUS WEB DATABASES by Hemal Khatri

Incompleteness due to missing attribute values (aka “null values”) is very common in autonomous web databases, on which user accesses are usually supported through mediators. Traditional query processing techniques that focus on the strict soundness of answer tuples often ignore tuples with critical missing attributes, even if they wind up being relevant to the user query. Ideally, the mediator...

متن کامل

Accumulation profiles of PrPSc in hemal nodes of naturally and experimentally scrapie-infected sheep

BACKGROUND In classical scrapie, the disease-associated abnormal isoform (PrP(Sc)) of normal prion protein accumulates principally in the nervous system and lymphoid tissues of small ruminants. Lymph nodes traffic leukocytes via lymphatic and blood vasculatures but hemal nodes lack lymphatic vessels and thus traffic leukocytes only via the blood. Although PrP(Sc) accumulation profiles are well-...

متن کامل

CSE 190A Project Proposal: 3D Photography

This paper presents a research proposal for CSE 190A: Projects in Vision and Learning, Winter 2007, on the subject of 3D photography. It addresses the objectives, datasets, milestones, and student qualifications for the project.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005